An effective algorithm for hyperparameter optimization of neural networks
نویسندگان
چکیده
A major challenge in designing neural network (NN) systems is to determine the best structure and parameters for the network given the data for the machine learning problem at hand. Examples of parameters are the number of layers and nodes, the learning rates, and the dropout rates. Typically, these parameters are chosen based on heuristic rules and manually fine-tuned, which may be very time-consuming, because evaluating the performance of a single parametrization of the NN may require several hours. This paper addresses the problem of choosing appropriate parameters for the NN by formulating it as a box-constrained mathematical optimization problem, and applying a derivative-free optimization tool that automatically and effectively searches the parameter space. The optimization tool employs a radial basis function model of the objective function (the prediction accuracy of the NN) to accelerate the discovery of configurations yielding high accuracy. Candidate configurations explored by the algorithm are trained to a small number of epochs, and only the most promising candidates receive full training. The performance of the proposed methodology is assessed on benchmark sets and in the context of predicting drug-drug interactions, showing promising results. The optimization tool used in this paper is open-source.
منابع مشابه
Modeling and Optimization of Roll-bonding Parameters for Bond Strength of Ti/Cu/Ti Clad Composites by Artificial Neural Networks and Genetic Algorithm
This paper deals with modeling and optimization of the roll-bonding process of Ti/Cu/Ti composite for determination of the best roll-bonding parameters leading to the maximum Ti/Cu bond strength by combination of neural network and genetic algorithm. An artificial neural network (ANN) program has been proposed to determine the effect of practical parameters, i.e., rolling temperature, reduction...
متن کاملHardness Optimization for Al6061-MWCNT Nanocomposite Prepared by Mechanical Alloying Using Artificial Neural Networks and Genetic Algorithm
Among artificial intelligence approaches, artificial neural networks (ANNs) and genetic algorithm (GA) are widely applied for modification of materials property in engineering science in large scale modeling. In this work artificial neural network (ANN) and genetic algorithm (GA) were applied to find the optimal conditions for achieving the maximum hardness of Al6061 reinforced by multiwall car...
متن کاملThe Optimization of the Effective Parameters of the Die in Parallel Tubular Channel Angular Pressing Process by Using Neural Network and Genetic Algorithm Methods
One of reasons that researchers in recent years have tried to produce ultrafine grained materials is producing lightweight components with high strength and reliability. There are disparate methods for production of ultra-fine grain materials,one of which is severe plastic deformation method. Severe plastic deformation method comprises different processes, one of which is Parallel tubular chann...
متن کاملOn Hyperparameter Optimization in Learning Systems
We study two procedures (reverse-mode and forward-mode) for computing the gradient of the validation error with respect to the hyperparameters of any iterative learning algorithm. These procedures mirror two ways of computing gradients for recurrent neural networks and have different trade-offs in terms of running time and space requirements. The reverse-mode procedure extends previous work by ...
متن کاملThe Optimization of the Effective Parameters of the Die in Parallel Tubular Channel Angular Pressing Process by Using Neural Network and Genetic Algorithm Methods
One of reasons that researchers in recent years have tried to produce ultrafine grained materials is producing lightweight components with high strength and reliability. There are disparate methods for production of ultra-fine grain materials,one of which is severe plastic deformation method. Severe plastic deformation method comprises different processes, one of which is Parallel tubular chann...
متن کاملTraffic Signal Prediction Using Elman Neural Network and Particle Swarm Optimization
Prediction of traffic is very crucial for its management. Because of human involvement in the generation of this phenomenon, traffic signal is normally accompanied by noise and high levels of non-stationarity. Therefore, traffic signal prediction as one of the important subjects of study has attracted researchers’ interests. In this study, a combinatorial approach is proposed for traffic signal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IBM Journal of Research and Development
دوره 61 شماره
صفحات -
تاریخ انتشار 2017